Tibetan speech synthesis based on an improved neural network

نویسندگان

چکیده

Nowadays, Tibetan speech synthesis based on neural network has become the mainstream method. Among them, griffin-lim vocoder is widely used in because of its relatively simple synthesis.Aiming at problem low fidelity vocoder, this paper uses WaveNet instead for synthesis.This first convolution operation and attention mechanism to extract sequence features.And then linear projection feature amplification module predict mel spectrogram.Finally, use synthesize waveform. Experimental data shows that our model a better performance synthesis.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AN IMPROVED CONTROLLED CHAOTIC NEURAL NETWORK FOR PATTERN RECOGNITION

A sigmoid function is necessary for creation a chaotic neural network (CNN). In this paper, a new function for CNN is proposed that it can increase the speed of convergence. In the proposed method, we use a novel signal for controlling chaos. Both the theory analysis and computer simulation results show that the performance of CNN can be improved remarkably by using our method. By means of this...

متن کامل

Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model

This paper proposes a deep neural network (DNN)-based statistical parametric speech synthesis system using an improved time-frequency trajectory excitation (ITFTE) model. The ITFTE model, which efficiently reduces the parametric redundancy of a TFTE model, improved the perceptual quality of the vocoding process and the estimation accuracy of the training process. However, there remain problems ...

متن کامل

An Improved BP Neural Network Algorithm Based on Factor Analysis

Back-Propagation (BP) neural network, as one of the most mature and most widespread algorithms, has the ability of large scale computing and has unique advantages when dealing with nonlinear high dimensional data. But when we manipulate high dimensional data with BP neural network, many feature variables provide enough information, but too many network inputs go against designing of the hidden-...

متن کامل

Traffic Prediction Based on Improved Neural Network

Artificial neural networks and genetic algorithms derived from the corresponding simulation of biology, anatomy. The paper analyzes the advantages and the disadvantages of the artificial neural networks and genetic algorithms. The artificial neural networks and genetic algorithms to be combine in the prediction model. This method is used to predict traffic volume in a road, the accuracy of fore...

متن کامل

Phone-based speech synthesis with neural network and articulatory control

This paper presents a novel method for synthesizing speech signal using a phone-based concatenation approach. Neural network is employed for the generalization of the phone templates during synthesis. Simpli ed articulatory space input parameters based on a modi ed vowel diagram are used to provide exible and e ective articulatory control. It also enables the design of an articulatory control m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: MATEC web of conferences

سال: 2021

ISSN: ['2261-236X', '2274-7214']

DOI: https://doi.org/10.1051/matecconf/202133606012